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. - ABSTRACT , . '< 

... < •* % 

Log linear models are proposed for the analysis of structural relations 

m 

among multidimensional developmental contingency tables. -Models of quasi- 
independence are suggested "for testing specific hypothesized patterns of 
development. Transitions in developmental categorizations are described 
fey^Markov models applied to successive contingency tables. A, discussion 
of the role of Pearson chi square and'log likelihood significance tests in 
model selection is followed by two illustrative data sets. 
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Log linear models are a rejcentl y-devel oped* data analysis* technique 

• ■ , ,\ * y , . 

that provide a means for tne analysis of structural relations affl&ng multi- 
dimensional developmental contingency tables. Utilizing Piaget's (1970) 
definition of structure as a set of transformations governing-a>process , j 

of self regulation, hypotheses can be framed concerning synchrony among { 

• *- » 

/ 

developmental domains (Bates, Begnigni, Bretherton, Camiori 5 'and Voltera,^ 
1979; Wohlwill, 1973). *For example, one could explore patterns of re- 
lationship among perspective taking levels and other domains of social 
cognition such as friendship (jSelman, I980).or arriong several linguistic * 
and cognitive classifications (Bates et al . , 1979). In addition, changes ^ 
in structural relations over times as in Bates et al . (1979) discussion. 

7 - 

of local' homology models for examining relations 'b^t^feen gestural -and 
linguistic complexes, are also germane to Pi aget's 1 . concept of - structure. » 

The organization of this investigation is divided into four sections. 
In the first section, an outline of the log linear model is presented as 
well as the rationale for tests of significance. The nex't section discusses 
developmental hypotheses with respect to developmental contingency tables. 
-Thirdly, log linear approaches to developmental contingency tables are 
presented. The fourth section follows .with two illustrative examples. 

Log Linear Models 

Log linear- models are structural models, .describing cross pl^assi f i ed ^ j 

data. The complexity of the data is^efl^cted by the number of parameters . • 

♦ 

in the model describing its structure (Bishop^ Fienberg, and Holland, 1975). 
For a two way contingency table of order I (i - 1,2. . .1) by J (j = 1,2. . .J), 
the logarithm of the expected count in the Mij-th ,ce11 is written in 
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0) where log represents the natural logarithm of th^Jexpected values 
m and the u parameters are, analogous to their counterparts in an .analysis 
of variance model (see Bishop, Fienberg, and Holland, 1975; Everitt, 1977; % 
^Vienberg, 3977). 

(1) logM.. = u + u 1(i}s+ u 2(j) + u 12(ij) . 

The analogy to analysis of variance is made obvious by inspection of -the 
maximum likelihood parameter estimates presented in (2), (3), and (4)i where 
, 1.. = log m.. for all i and i . 4 ^ 

(3) u u -\ = (1 . /J) - u where^l . = X L • andos the mean for each 

each row, j 

" * (4)u 12(^) = 1 ij " ( u + u l(i) +u 2(j) ) 

♦ 

Jhe nfaximum likelihood estimates presented in (2), (3), and^ (4) are Equivalent 
across a wide spectrum of sampling assumptions, ranging from the assumption 

o~f NN Samp1e ^ize as a random* variable (f>oisson sampling) to the assumption of 

i 

fixed marginal configurations of sums or product multinominal sampling (see 
Birch, 1963). Marginal sums refer to the total sum for any row or column. 
Hierarchical Models * * , 



Log linear models are often arranged hierarchically to facilitate 
comparisons among models. Another model that couldapply to a two way table 
is written in (5), where the parameter estimate analogous to interaction 
is deleted. 

(5) logrr, . ^ + u 1(ij + u 2(j) # •* . 

_ . jtf *. - v 

Since the degrees of freedom for any. model are equivalent to the number of. 
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parameter estimates subj:ract£d from the total number of cells, the model 
in (1) is said ta be "saturated" whereas (5) is ^unsaturated" as ,it has 
fewer parameters than data cells. lit addition, . (5) is subsumed by(l)> as 
it contains all the parameter estimates of (1)," less one.' Thus, (5) and 
.(1) are hierarchically^ related. 
Notation and Hierarchical Arrangement > 

I^hierarCtfical- models are assumed, each parameter estimate can be 
expressed convienently by a variable number and a set of brackets, following 
Fienberg (1977), with the'entire model represented by the highest terms in 
the expression. Thus (1) is expressed as /J2J designating, "main effects" 
and "interaction" and (5) is expressed as / T/M 2/ , 'designating "main effects" 
without "interaction. " v • " * 

Goodness of Fit 



The relative adequacy* of a model can be tested by summary measures of 
the goodness of fit. The Pearson chi square statistic, presented in (6), % 
is asymptotically distributed as )C'Under tfts null hypothesis of independence. 
The log likelihood, ratio statistic, G , writtenVin (7) is also asymptotically 

^ with degrees of freedom appropriate "for the model 
under estimation, although the Pearson statistic has been demonstrated to 
follow the asymptotic X distribution more closely when the table is spufse" 
(Lar-ntz, 1978). ^ 

(6) X = i (Xj'* ^i^ /^i 5 where M represent observed and estimated 

t- * 

> • 1 r values, respectively 

(7) G 2 = 2 ^ .log (X./M.) 

Goodness o f Fit and Nested Hierarchies 
f — * ■ ■ - •— 

The log-likelihood ratio statistic, G / is minimized by the process of 
maximum likelihood estimation and can be partitiQned additively if models 
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are arranged hierarchically. Thus, the relative increment in selects pn$, 

V' ' % ? * J 

model rather than another is indexed by^ the. change in the G statistic / 

(Goodman, 1969). For example, if two linear models are r>ested, Wherein 

mo'de^ twp has only a subset of the parameter estimates (u-terms) contained 

2 

in the first model, the G statistic for- the second model is presented in 
18), and represents a partition into a measure of the distance of 'the ' 
parameter "estimates ' ^ k 

(8) G 2 (2) = G 2 /T2)/(117 + G 2 (l) 

4 k ' * 

of model '(2) as compared to model (1),^ as well as the distance of the parameter 
estimates fo? the first model (Bishop, Fienberg, and Holland, 1975; Fientrerg, 
(1977). Model fitting, as in quantitative analysis (Joreskog, 1.978), is a 
process of trading simplicity of expression for explanation of all of the 
observed data. The addition of parameters will .inevitably increase the 
goodness of fit of the model to . the observed data, but the incremental in- 
crease in fit obtained with additional parameters, especially for nested 
models, is apt to be more important than. accounting for all observed fre- 
quencies. Th^. latter, accounting^ for ap observed frequencies, is possible 
because log linear models are not stochastic} 1°9 linear models do not 
explicitly represent errors of measurement etc. * 

Quasi -Independence , ' 
Although the log linear model Allows for a wide variety of structural 
models, often an investigator h^s hypotheses concerning the specific cells 
that should contain the observed^counts. By fixing these "hypothesized" 
cells to have an a priori value of zero, termed" structural zeroes rather than 
sampling zeroes (Bishop et al. , 1975) , one could then expect the remaining i 
observations- to be independent of variable classification. The degrees of 
freedom appropriate for a test statistic under the model of quasi independence 



.is presented in (9), where V is the number of degrees of freedoijr usually 
associated with the model for the*comp.lete case, Z g is the number of cells 
with structural zeroes,* and'Zp is the number of zero Entries in the expected- 
marginal configurations. K - ■ \ 

t (9) DF = V - Z e m- z p ' 

Developmental Contingency Tables ♦ 
Developmental processes can be structured in terms of continuity and 
change in # the organization of behavior./ Continuity in organization is 
often 'framed as a stage, where a stage is assumed to represent a benchmark 
or prototypical organization of behavior (Feldman," 1980). Change is 
ascribed to stage transitioned occurs in tire context of a general, model of 
equilibration (Pfaget, 1970). One concern for developmental psychology is . 
whether or not an individual's organization., or structure, is consistent 

4 * ' 

across domajns (cognitive, social) or .tasjcs (object permanence, means-end). 
In addition to hypotheses concerning relative stage synchrony or asynchrc^, 
a second dimension of structure concerns patterns of change in classification 
over time. 

Typically, particular hypotheses about stage synchrony are operational ized 

as expectations of all counts in an I x J contingency table, wfrere I n / 

(i = 1,2. . .1) indexes the number of rows and J (j = 1,2. . .J) indixes k 

the number of columns* Table 1 represents the general I £ J. table. As in 

(6), X.. denotes an observed count, and "event" can denote domains, tasks, t 
ij * • • 

or 'times of observations (Hoffman, 1 980). 

— ^ 

Insert Table 1 herje 



:/ ■ ■ . . . v 

Develcfprnenta-l Models: Two Way Tables • 

- . Wohlwill (1973) has proposed several possible* patterns of observed / 

" i | 

frequency counts. Model I -corresponds to an* interpretation of Pi^get's * s 

*tb^ory that assumes synchronous mental development for different domains or ] 

• ta$ks. For a two way table, one woul'^d expect the diagpnal entries to 

contain all of the observed. values; off diagonal entries are-treated as errors $ 

of misclassification. The entries in Table. 2 , .indexed by I, represent the * 
i 4 

pa'ttern of expected counts for model I. 



, — .-j 

. Insert Table 2 Here ' • ^ 1 

* » * 

* ; 

C~ — —— — — — ~" — — — — — — — — — _ 

( A second model proposed by Wohlwill was the triangular hypothesis of 
decalage. For example, if event one is^a pfequisi te* for a second §vent (Ha), 
or if the first event is a preintervention observation and the second, a postr 
intervention observation (lib), one could expect observed entries to reflect ft 
a predominance of off diagonal cell entries as Well as the diagortel entries 
of Model If The cell entries for model I la are below the diagonal and 
conversely, are above the'diagonal for mode^.IIb, as-presented in Table 2. 
Entries that are not in cells designated by the triangular 'hypothesis are 
considered to be a product of errors *j n measurement, * 
Developmental Models: Mul tidimensionalvT^bles v 



Previous efforts have focused* ori structural analysis of the two way T 

developmental^table (Hoffman, 1980; Tholras, 1977). 1 However, restriction to 

the two way cross clas'sifi cation is pcrtentially.misleading, especially for a 

» » * * * 

complex* data set. The problem -Ps* analogous to the effects of partial 
correlation in quantitative analysis; two variables mciy appear to be related 
due to a common relationship with a third variable. In addition, questions* 
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♦concerning (a) changes in multidimensional cross classified structure over 
time or (b) Structural relations among, several domains, remain unanswered by 
reliance or saries of two way tables. To illustrate, developmental models 

for a three way table are presented next. Table 3 represents the genera-l 

P ' " - ' ' 

three dimensional developmental cdntingensy table. u As 'in the two way 

case ; I (i = 1,2. . .1) indexes rows „ J (j = 1,2, . .J), indexes columns, 

.and for the three dimensional taUe, K (K = 1 , 2. . .K), indexes "slices. " 

*ijk ^ nc * exes the observed counts. / 

_ ; ( • 

< Insert 1 Table 3 Here" ' - 



• - Log Linear Models for Developmental - Relations * 

Several models "of relatien among events are conceivable;' 'three- are • 
proposed for illustrative purposes. ModeV III is a "diagonal" hypothesis of 
complete synchrony among domains or tasks, corresponding to Model I in the' 
two way array. If Model III is assumed, expected counts should' be localized 
in the cells on the djJagonal of the three way array, X T lt/ for al 1 i = j = k, 
and assuming, for simplicity, I = J = K. 'For a three way array, the 
saturated model, /T 2 '37, is presented in (10). 

(10)' log M. jk = u 1(i) + u 2(j) + u 3(k) + u 12(ij) + u 13(ik) + u 23(jk) + 

*» 

. . U 123(ijk) 

If model III represents the true state of events, then the application of' 

* 

structural ^eroes to the diagonal cells, followed by an appropriate adjust- 
ment iii the degrees of freedom, as noted in (9), results in a model of quasi-* 
independence among the three events; fjl jll /I/, which is presented in (11). 

m ^ m tjk = u l(i) + u 2(j) + u 3(k) _ ' 
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Model IV < 

Model IV, "block diagonal" ttr partial sypohrony, might arisen in the 
case where'eyents two and three are synchronous' But exhibit decalage. Wi th 
respect to" the f/i r-st event. This model corresponds to a variation .of the 
triangular hypothesis of the two way -table. For example, £he firs^'event*^ 
could be a prerequisite for events two and.theee, which,, in turn, are • 

Synchronous. -For. I'= J = K, observed counts should be localized in X..,. 
cells where i = j = k and i>(j = k). Again, the model of quasi independence 

i 1)1 Uf Z.3/, could be applied tjp the three way array, to test-the feasibility 
of model IV. • * m 

Model V ' ^ ' "■ : v 



Model' V is a model for conditional independence among developmental events 
For "example,, if process C is 'assumed to underlie response patfeerr/s A* and B, 
then levels of A and B s^oi/ld be conditionally independent for each fixed ■ 

value of C. v Conditional independence is analogous to the«concept of -partial* 

* & * 

correlation in quantitative, analysis, in that the partial correlation between 
two variables may be small if the effects of % a third variable are accounted 

for. For example, if perspective taking is presumed to be the process by. 

* • * « # • 

which social structure evolves, then tone might expect liigh relationships , 

among measures of perspective taking arid structural relations^ among other' 

domains of social cogni^fbn. father than focusing on relative -synchrony 

among domains,, conditional independence models could represent a test for the 

relative contribution of a process, e:g., persp v ective*taking, to a set of 

* ' ' t 

response relationships. -For a, three way array*, a model of 'Condi tjonal 

• • 

independence fbr B and C, £gnsidered 'as the sepond and third variables, - 
respectively, is presented in (12). Note that interaction terms^containing 
AI2/ have been assumed to' be null for (12); thus, (12) is nested within (10)." 

f J 
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■ (12) l0 9. m ijk = u l(i) +u 2(j) + u 3<k) * u 13(i-,k) +u 23(jk) • '., ''. 
If models describes the data structure adequately, the G 2 statistic shoule 
not be significant at the chosen <*L level. ' 
Transitions ,in Time: Contingency Tables and Markov Models • - 

In the course of an intervention study, one might assess the. stability 
of change over time. If changes ire state can 'be displayed a-s contingency 
tables, varying from time 1 to time T,. then the process can be' modeled by 

v * % 

Markov 'chairfs, where each individual is classified at e4ch time. -Markov 
Chain .mbdels for cross classified data have two components— an* initial * 
probability vectof* of category^ probabilities and'a matrix of transition 
probabilities* denoting the probability o'f category stability or category 
transition across time. The initial probabilities will be regarded as a 
reference point, so further disqussion will concern^the matrix of transitio 
9 probabilities. # 

'Transition^probakilities are easily estimated'from cross classified \ 
data. Tablp 4 presents a poll of voting preference and is taken from 
Bishop et al.,0975, p. 259 ), The four transition probabilities are -estimate 
by (13) and a cogent derivation is presented in Bishop et a.l,. (1975).' 

(13 > V V x i + ' ' • • ' . , 



' Insert Table 4 Here 



Markov Models and Stationari t.y * * . - 

The order' of a Markov ^chain refers to the assumptions* concerning .the 
underlying- process of change). If the state occupied by an^indi vidua! at tin 
t depends only on his state at time t-1 , *the process is'saod to, be first . 



order. Restricting attention to the first order chain,, ff t-he transition 
probabilities ar&* independent of time, then the processes described as 
stationary , r$gj^d/ess ef order. In ojther words, the 'iVtfli vidual* s. responses 
are stable across thettime interval sampled. 
Log Linear Models and Markcjv Chains * * 



If an. event h§& l;.c£Ptegories and is measured at T times (t = 0,3. . .T), 
•f - / 1 > 

the statioparity of the' transition probabilities is tested "by* first arranging f 

the data into an I x I x T table, where each of the T tfarfsition arrays, 'of 

order I x I, is a layer in the I x I x T taWe. If time is th'e third variable, 

the modeVof conditional independence presented, in (14) should fit the 

observed count's in the I x I x T table. Conditional independence indicates , > 

^that time and,ofor example, voting -preference., are independent, conditional 

on initial voting preference. Similarly, one could envision intervention 

. Studies where the process is stationary for the control group, but notjfor 

V s ' 

the treatment group. - 

(14) log,M. jk = u + u 1( : } + u 2(jX + U3 (k) - + u 12(1j) *u 13(1| ^ 

I - * . * '4 

Illu strations _ • - k 



i m Two illustrative example*;, are provided to highlight the appl'%oation of 
log linear models to developmental contingency tables. 
Two Dimensional Array - 

Table 5 contains* an I x J tabl'e, 1 adapted from WohlwUl Q$73, R. 220), # - 
of synchronous progression between levels of concept A and levels of concept B, 
The ,appl ication of Model I to these data requires fixing the diagonal* entries, 

* * • * 



Insert\Ta'ble 5 here 
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where* i = j, ro zeVo and testing the log linear model of quasi independence 

\ \ " 

Model I la , the triangular hypothesis, involves structural zeroe? 

r ' 
w • *V — % 

^ "for all X. . where *i >j, as'well as for the diagonal entries', i = j % The - 

model of quasi independence is then fitted to the entries in the table, 

2 4 
Table 6 presents the G statistic, with accompanying degrees of freedom* 

for the model of complete independence^ followed by models of quasi 

independence (I,* Ha). The mdfbl of synchrony appears to fit these data 

best as evidenced by inspection of Table 6 -and the significance of the -» 

' difference 'statistic; G 2 (FULL) - G 2 (I) = 167.63 with 5 df. 



Table 6' insert: here* 



Three Dimensional "Array ^ * * 

An I x J x K table of artificial data- is presented in Table 7/ with 



Insert Table 7 here 
\ 



J. = K = 3. Model - HI, complete sync^6ny,'is a test of the log linear 

f 

model of quasi independence, with structural zeroes for the diagonal cells, 



e i = j = k. The G 2 statistic for. Model' III is 35". 88 withal? df. In 



contrast, Model IV, partial synchrony, involves a test of quasi independence 
with structural zeroes as .in Model III, in-addition to cells where i (j = k) 
(X211, ^3ii> ^322^'* ^ n€ \^ statistic for this version of Model IV yields a 
G 2 statistic of 29.64 with 14 df. The difference statistic, G 2 (III) - • 

* ^ * 

2 ' 
G (IV), .is 6.34 with 3 df, >which is not significant at cJL = .05, suggesting 

that Model III may be more appropriate for this* dalba set, 

N ■ 1/ 
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" o 9 . ' 

The log linear model is appropriatejfor the analysis of multi- 
dimensional; developmental contingency tables. Several models of develdp- 
ment can be postulated concerning relations among events; changes over 
time, or some combination thereof. Mociels of quasi independence allow 

v 

for point hypotheses testing while utilization of modular hierarchies may 
help investj gators to formulate clearer hypotheses in relation to prior 

expectations, particularly in relation to the relative contribution, as 

2 V a 

indexed by the G statistic, of each event to the total gestalt. Thd*^ 

intention of this presentation was to establish a script for the analysis 

of developmental relations, while allowing. for variation and extension 

to new themes by^othetk investigators,. 
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Table 1 



EVENT ONE- Stage 1 
^tage 1 X 

Stage 2 
, t 

» 

Stage I & X 



EVENT TWO 
Stage 2 



Stage 3 - 




Table 2 
EVENT TWO 



EVENT ONE ; 


Stage 1 


Stage -2— 


Stage 1 


* r 


lib • 


Stage 2 
i 


I la . 


i 1 


Stage I 


Ila 


J la 




a 

4 
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Event ONE 



Stage 1 



Stage 2 



Stage, I 



Stage 1 



Stage 1 — K 



Table'3 

EVENT TWO 
Stage 2 -■ 
EVENT THREE 
Stage 1 — K 



-Stage J 



Stage 1-— 




First Poll 
Same * 

Different 

''■Totals 



Same 



11 



72 



X+1 



O 



Table 4 

( ■ 



Second Poll 



Different ' - Totals 
XI* 



12 



s 21 



X+2 



X2+ 
IC 



.19 



> 

A 

* < 






f 

5 - • ' 


• 


- . 




• 


i 
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Table 5 

/ 

Levels of "concept A 






0 

• 


} 

/ 

s / 


Levels of concept B 




- 2 3 


4 


5 
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1 


18 


2 0 


0 


ft* 








. ' 2 ' 
3 


2 
0 


16 2 
2 16 


0 
2 


0 
0 


> 


1 




• -4 


0 


0 2 


16 . 


2 


• 




• 


5 


0 


0 0 


2 


J 8 








1 




— : T- 










<• 
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J 


• 




• 
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V 

< 




* * 

Model k 1 
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'Table 6 
G 2 




< 


Df 


\ 


• 


1 Complete Table 


183.3' 




- 


16 






I *' 

'•^X Ha 




*- ''15.67 

s 

10.15 






11 
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Table 7 

EVENT TWO 
% 1 2 3 

EVENT THREE 



, ' ' Event One 


1 


2 


3 


1 


2 


3 • 


1 


2 


3^ 


r 


21 


2 


2 


4 ' 


3 


3 


1 


4 


4 
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2 


7 


■5 


3 


3 


30 


SA. 


2 


4 


' 5" 


3 


6 


2 


1 


3 


7 


*3 


1 


4 


26 



